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Abstract. We consider the problem of characterizing entrywise functions that preserve the cone of 
positive definite matrices when applied to every off-diagonal element. Our results extend theorems 
of Schoenbcrg [Duke Math. J. 9], Rudin [Duke Math. J, 26], Christensen and Ressel [Trans. Amer. 
Math. Soc, 243], and others, where similar problems were studied when the function is applied to 
all elements, including the diagonal ones. It is shown that functions that arc guaranteed to preserve 
positive definiteness cannot at the same time induce sparsity, i.e., set elements to zero. These results 
have important implications for the regularization of positive definite matrices, where functions are 
often applied to only the off-diagonal elements to obtain sparse matrices with better properties (e.g., 
Markov random field structure, better condition number). As a particular case, it is shown that 
soft-thresholding, a commonly used operation in modern high-dimensional probability and statistics, 
is not guaranteed to maintain positive definiteness, even if the original matrix is sparse. This result 
has a deep connection to graphs, and in particular, to the class of trees. We then proceed to fully 
characterize functions which do preserve positive definiteness. This characterization is in terms of 
absolutely monotonic functions and turns out to be quite different from the case when the function 
is also applied to diagonal elements. We conclude by giving bounds on the condition number of a 
matrix which guarantee that the regularized matrix is positive definite. 



1. Introduction 

In one of his celebrated papers, Positive definite functions on spheres [Duke Math. J. 9, 96-108], 
I.J. Schoenberg proved that every continuous function / : (—1, 1) — >• M having the property that 
the matrix (/(oij)) is positive definite for every positive definite matrix (a^j) with entries in (—1, 1) 
has a power series representation with nonnegative coefficients. Functions satisfying this latter 
property are often known as absolutely monotonic functions. The aformentioned result has been 
generalized by Rudin [Duke Math. J, 26, (1959) 617-622] who showed that the class of absolutely 
monotonic functions fully characterizes the class of (not necessarily continuous) functions mapping 
every positive definite sequence to a positive definite sequence. Equivalently, the class of absolutely 
monotonic functions are exactly the functions mapping sequences of Fourier-Stieltjes coefficients to 
sequences of Fourier-Stieltjes coefficients. 

In this paper, we revisit and extend Schoenberg's results with important modern applications in 
mind. Positive definite matrices arise naturally as covariance or correlation matrices. Consider an 
n X n covariance (or correlation) matrix S. In modern high-dimensional probability and statistics, 
two of the most common techniques employed to improve the properties of S are the so-called hard- 
thresholding and soft-thresholding procedures. Hard-thresholding a positive definite matrix entails 
setting small off-diagonal elements of S to zero. This technique has the advantage of eliminating spu- 
rious or insignificant correlations, and leads to sparse estimates of the matrix E. These thresholded 
matrices generally have better properties (such as better conditioning) and lead to models that are 
easier to store, interpret, and work with. At the same time, in contrast with most "regularization" 
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techniques, this procedure incurs very little computational cost. Hence it can be applied to ultra 
high-dimensional matrices, as required by many modern-day applications (see [2j). 

An important property of thresholded covariance matrices that is generally required for applica- 
tions is positive definiteness. Nonetheless, regularization procedures such as hard-thresholding are 
often used indiscriminately, and with very little attention paid to the algebraic properties of the 
resulting thresholded matrices. It is therefore critical to understand whether or not the cone of pos- 
itive definite matrices is invariant with respect to hard-thresholding (and other similar operations), 
especially in order for these regularization methods to be widely applicable. 

We now formalize some notation. Given e > 0, the hard-thresholding operation is equivalent to 
applying the function : R — >■ R defined by 

^^-^^ -^^^^^^ " { othe'rwise 

to every off-diagonal element of the matrix S. As mentioned above, modern probability and statistics 
require that the thresholding function is applied only to off-diagonal elements. As a consequence, 
previous results from the mathematics literature cannot be directly used to determine whether hard- 
thresholding and other similar techniques preserve positive definiteness. The aim of this paper is to 
investigate this important question, especially given its significance in contemporary mathematical 
sciences. 

Algebraic properties of hard-thresholded matrices have been studied in detail in [2j, where it is 
shown that, even if the original matrix is sparse, hard-thresholding is not guaranteed to preserve 
positive definiteness. Thus the function does not map the cone of positive definite matrices into 
itself. 

A type of function that is equally frequently used in the literature is the so-called soft-thresholding 
function : M — t- M, given by 

(1.2) /f(x) = sgn(x)(|x| -e)+, 

where sgn(2;) denotes the sign of x and (a)+ = max(a, 0). Compared to hard-thresholding, soft- 
thresholding continuously shrinks all elements of a matrix to zero, thus giving more hope of preserv- 
ing positive definiteness than hard-thresholding. To the authors' knowledge, a detailed analysis of 
whether or not this is true has not been undertaken in the literature. It is also natural to ask whether 
the hard or soft -thresholding function can be replaced by other functions in order to induce sparsity 
(i.e., zeros) in positive definite matrices and, at the same time, maintain positive definiteness. 

The first theorem of this paper extends results from [2] and shows the rather surprising result 
that, for a given positive definite matrix, even if it is already sparse, there is generally no guarantee 
that its soft-thresholded version will remain positive definite. We state this result below: 

Theorem. Let G = {V, E) he a connected undirected graph and denote by Fq the cone of positive 
definite matrices with zeros according to G 

(1.3) P+ := {A = (a,,) € P+ : a,,- = if (ij) ^E,i^ j}, 

where P^ denotes the cone of all positive definite matrices. For e > 0, denote by 77e(vl) the soft- 
thresholded matrix 

(1.4) (r?,(A)),, = | f'^^ . 
^ ^ V /ev I otherwise 

Then the following are equivalent: 

(1) There exists e > such that for every A G P^, we have r]e{A) > 0; 

(2) For every e > and every A G P^, we have r]^{A) > 0; 

(3) G is a tree. 
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Following this result, we extend Schoenberg's results by fully characterizing the functions that 
preserve positive definiteness when applied to every off-diagonal element. The statement of the main 
theorem of the paper is given below. 

Theorem. Let < a < oo and let f : {—a, a) — t- M. For every matrix A = {aij), denote by f*[A] 
the matrix 



Then f*[A] is positive semidefinite for every positive semidefinite matrix A with entries in (—a, a) 
if and only if f{x) = xg{x) where: 

(1) g is analytic on the disc D(0,a); 



(3) g is absolutely monotonic on (0,a). 
When a = oo, the only functions satisfying the above conditions are the affine functions f{x) = ax 
forO<a<l. 

The above result does come as a surprise. It formally demonstrates that, except in trivial cases, 
no guarantee can be given that applying a function to the off-diagonal elements of a matrix will 
preserve positive definiteness. There are thus no theoretical safeguards that thresholding procedures 
used in innumerable applications will maintain positive definiteness. 

The remainder of the paper is structured as follows. Section [2] reviews results that have been 
recently established for hard-thresholding. In Section [3| a characterization of matrices preserving 
positive definiteness upon soft-thresholding is given. The characterization turns out to have a non- 
trivial relationship to graphs and the structure of zeros in the original matrix. Section |4] then studies 
the behavior of positive semidefinite matrices when an arbitrary function / is applied to every ele- 
ment of the matrix. A review of previous results from the literature is first given. The results are 
then extended to include the case where the function is applied only to the off-diagonal elements of 
the matrix. A complete characterization of functions preserving positive definiteness in this modern 
setting is given. Finally, Section [5] gives sufficient conditions for a matrix A and a function / so that 
the matrix f*[A\ remains positive definite. In particular, it is shown that the matrix f*[A] is guar- 
anteed to be positive definite as long as the condition number of A is smaller than an explicit bound. 

Notation: Throughout the paper, we shall make use of the following graph theoretic notation. 
Let G = (y,E) be an undirected graph with n > 1 vertices V = {1, . . . ,n} and edge set E. Two 
vertices a,b £ V , a ^ b, are said to be adjacent in G if (a, b) £ E. A graph is simple if it is undirected, 
and does not have multiple edges or self-loops. We will only work with finite simple graphs in this 
paper. 

We say that the graph G' = {V , E') is a subgraph of G = {V, E), denoted by G' C G, if C F 
and E' C E. In addition, if G' C G and E' = {V' x V') n E, we say that G' is an induced 
subgraph of G. A graph G is called complete if every pair of vertices are adjacent. A path of length 
k > \ from vertex i to j is a finite sequence of distinct vertices vq = i^ . . . ,Vk = j 'vn V and edges 
[vq, vi)^ . . . , (wfc-i, Vk) £ E. A k-cycle in G is a path of length k — 1 with an additional edge connecting 
the two end points. A graph G is called connected if for any pair of distinct vertices i,j £ V there 
exists a path between them. 

A special class of graphs are trees. These are connected graphs on n vertices with exactly n — 1 
edges. A tree can also be defined as a connected graph with no cycle of length n > 3, or as a 
connected graph with a unique path between any two vertices. 

Graphs provide a useful way to encode patterns of zeros in symmetric matrices by letting {i, j) S E 
if and only if Oij ^ 0. Denote by the cone of n x n symmetric positive definite matrices, and by 
the cone of positive definite matrices (of any dimension) . We shall write ^4 > whenever A G P"^ 
and A > B if A — B £ P"*". Similarly, we write ^4 > whenever A is positive semidefinite, and A > B 



(1.5) 




f{aij) ifi^j 
aij 'ifi=3 



(2) llfflloo < 1; 
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if ^ — i? > 0. We define the cone of positive definite matrices with zeros according to a given graph 
G with n vertices by 

(1.6) P+ := {A G P+ : a^, = if ^E,i^ j}. 

Denoting the space of n x n matrices by M^, recall that a (ni + 722) x (m + 112) symmetric block 
matrix 

A B 
D 



M 



where A G M^-^xnn 

B G 

Mmx?i2; Slid D G Mn2xn2) i'' positive definite if and only if D is positive 
definite and Si = A—BD^^B^ is positive definite. The matrix Si is called the Schur complement of D 
in M. Alternatively, M is positive definite if and only if A is positive definite and S2 = D — B^A~^B 
is positive definite. The matrix 5*2 is called the Schur complement of A in M. Finally, for a symmetric 
matrix A, we shall denote by Amin(^) and Amax(^) its smallest and largest eigenvalues respectively. 

2. Review of relevant results on hard-thresholding 

Algebraic properties of hard-thresholding have been studied in [2j. In particular, two types of 
hard-thresholding operations have been considered. Let G be a graph with n vertices. The graph G 
induces a hard-thresholding operation, mapping every symmetric nx n matrix A = {aij) to a matrix 
Ag defined by 

aij if (i, j) £ E OT i= j 
otherwise. 



(2.1) iAG)^, 



We say that the matrix Aq is obtained from A by thresholding A with respect to the graph G. 

The following result from [2j fully characterizes the graphs preserving positive definiteness upon 
thresholding. 

Theorem 2.1 ([21 Theorem 3.1]). Let A be an arbitrary symmetric n x n matrix such that ^4 > 0, 
i.e., A G P^. Threshold A with respect to a graph G = {V,E) with the resulting thresholded matrix 
denoted by Aq. Then 

T 

(2.2) Ag>0 for any AeF+ -^G = [J Gi for some r G N, 

i=l 

where Gi, i = 1, . . . ,t, denote disconnected, complete components of G. 

The above theorem asserts that a positive definite matrix A is guaranteed to retain positive defi- 
niteness upon thresholding with respect to a graph G only in the trivial case when the thresholded 
matrix can be reorganized as a block diagonal matrix where, within each block, there is no thresh- 
olding. This result can be further generalized to matrices in P^ which are thresholded with respect 
to a subgraph H of G. The following theorem shows that thresholding matrices from this class yields 
essentially the same results as in the complete graph case. 

Theorem 2.2 ([2, Theorem 3.3]). Let G = {V,E) be an undirected graph and let H = {V^E') be a 
subgraph of G i.e., E' C E. Then Ah > for every A G P^ if and only if H = Gi U ■ ■ ■ U G^ where 
Gi, . . . , Gk are disconnected induced subgraphs of G. 



Theorems [2.1 1 and 2.2 treat the case of thresholding elements regardless of their magnitude. In 
practical applications however, in order to induce sparsity, hard-thresholding is often performed on 
the smaller elements of the positive definite matrix. The following result shows that only matrices 
with zeros according to a tree are guaranteed to retain positive definiteness when hard-thresholded 
at a given level e > 0. 

Definition 2.3. The matrix B is said to be the hard-thresholded version of A at level e if bij = aij 
when \aij\ > e ov i = j, and bij = otherwise. 
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(a) Hard-thresholding (b) Soft-thresholding 

Figure 1. Illustration of the hard and soft-thresholding functions with e = 3 



Theorem 2.4 ([21 Theorem 3.6]). Let G be a connected undirected graph. The following are equiv- 
alent: 

(1) There exists e > such that for every A G Fq, the hard-thresholded version of A at level e is 
positive definite; 

(2) For every e > and every A G ¥q, the hard-thresholded version of A at level e is positive 
definite ; 

(3) G is a tree. 

The result above demonstrates that hard-thresholding positive definite matrices at a given level e 
can also quickly lead to a loss of positive definiteness, though it is not as severe as when thresholding 
with respect to a graph. Recall that hard-thresholding a matrix A at level e is equivalent to applying 



the hard-thresholding function given in (1.1) to every off-diagonal element of A. It is thus natural 
to replace the hard-thresholding function by other functions to see if positive definiteness can be 
retained. A popular alternative is the soft-thresholding function (see (1.2), (1.4), and Figure [T|. 
The next section is devoted to studying the algebraic properties of soft-thresholded positive definite 



matrices. We conclude this section by noting that Theorem 2.4 also yields a characterization of trees 
via thresholding matrices. 



3. SOFT-THRESHOLDING 

We now proceed to the more intricate task of characterizing the graphs G for which every matrix 
A G retains positive definiteness when soft-thresholded at a given level e > 0. As soft-thresholding 
is a continuous function as opposed to the hard-thresholding function, it would seem that soft- 
thresholding may have better properties in terms of retaining positive definiteness. 



Definition 3.1. For a matrix A 
given by: 



(aij) and e > 0, the soft-thresholded version of A at level e is 



(3.1) 



sgn{aij){\aij\ 



0,4 j 



if i / j 
otherwise 



Theorem 3.2. Let G = {V, E) he a connected undirected graph. Then the following are equivalent: 

(1) There exists e > such that for every A G P^, we have r]^{A) > 0; 

(2) For every e > and every A G P^, we have ri^{A) > 0; 

(3) G is a tree. 
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Remark 3.3. Regardless of the continuity of the soft-thresholding function Theorem 3.2 demon- 
strates that soft-thresholding has the same effect as hard-thresholding when it comes to retaining 
positive definiteness (see Theorem 2.4). The proof of Theorem 3.2 given below, however, is more 
challenging as compared to Theorem |2.4[ Theorem |3.2| also gives yet another characterization of 
trees. 

Proof of Theorem \3.S\ (1 =^ 3) We shall prove the contrapositive form. Let C„ denote the cycle 
graph with n vertices. Recall that a tree is a graph without cycle of length n > 3. Thus, if G is not 
a tree, then it contains a cycle of length greater or equal than 3. Therefore, to prove this part of 
the result, it is sufficient to construct, for every n > 3, a positive definite matrix An G which 
does not retain positive definiteness when soft-thresholded at the given level e > 0. We will begin 
by providing such examples of matrices for a fixed value of e = eo := 0.1. We will then show how 
matrices with the same properties can be built for arbitrary values of e > 0. 
The following matrix 



(3.2) 



A. 



9.0817 
1.1024 
1.1024 



1.1024 
0.23359 
0.10237 



1.1024 
0.10237 
0.14398 



provides an example for n 
where 

(3.3) 



3, with threshold level e = 0.1. Also, notice that 1) the matrix A^ 



A. 




1.1024 
0.23359 
0.10237 





0.10237 
0.14398 



which is ^3 with the (1, 3) and (3, 1) elements set to zero, is positive definite, and 2) the matrix A 
stays positive definite when only the (1,3) and (3, 1) elements are soft-thresholded at level e = 0.1. 
We will construct a similar matrix A^ for n > 4 inductively. Properties 1) and 2) will be important 
to perform the induction step. 

Indeed, assume that, for some n > 3, there exists a matrix A^ S which loses positive definite- 
ness when soft-thresholded at level e = 0.1. Let us assume also that the matrix An obtained from 
An by setting the {^,n) and (n, 1) elements to is positive definite and that the matrix obtained 
from An by soft-thresholding only the (l,n) and (n, 1) elements at level e is positive definite. These 
properties are satisfied for n = 3 by the matrix A3 given above. We will build a matrix An+i G P^ 
satisfying the same properties. Let o„ denote the (l,n) element of A^- For every real number r, let 
Te := sgn(r)(|r| — e)+ denote the value of r soft-thresholded at level e. To simplify the notation, let 
us denote by an,e the value of {an)e- Now consider the matrix 

/ 



(3.4) 












"1" Dn 











b 


a„+i ... 06 


a 



where Dn is a diagonal matrix with positive diagonal ((a„+i^e)^/a, 0, 0, (6^)^/0). Notice that 
An+i has zeros according to Cn+i- We will prove that a„+i, 6, a can be chosen so that An+i satisfies 
the required properties. 

Let us first choose the value of function of a and b in such a way that 



(3.5) 



a 
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This is always possible if |6| > e. Indeed, if |6| > e, then a„+i satisfies equation (3.5) for 
(3.6) 



On+l 



-a— h se 

Of 



where s = sgn{— aan,e/ be)- 

We claim that we can choose a > and b > e such that: 

(1) An+i is positive definite; 

(2) An+i is positive definite; 

(3) An+i is not positive definite when soft-thresholded at level e, i.e., r]e{An+i) ^ 0; 

(4) An+i is positive definite when only its (1, n + 1) and (n + 1, 1) elements are soft-thresholded 
at level e. 

Conditions (1) and (3) are the two conditions needed to prove that the matrix An+i satisfies the 
theorem. Conditions (2) and (4) are required in the induction step. 

First, note that the matrix ^n+i has been constructed in such a way that the Schur complement 
of a in r]e{An+i) is equal to r]e{An). Therefore, by the induction hypothesis, r]e{An+i) is not positive 
definite for any value of |6| > e and a > 0. This proves (3). 

Since a > 0, to prove properties (1), (2) and (4), we only need to study the Schur complement 
of a in the three matrices: A^+i, An+i and in the matrix obtained from An+i by soft-thresholding 
the + 1) and {n + 1,1) elements. We will prove that properties (1), (2) and (4) hold true 
asymptotically as q, 6 — )• oo. Therefore, the result will follow by choosing appropriately large values 
of a and b. 

The Schur complement of a in is given by 



(3.7) 



\ 





a-n+lb 


a ' ' 


a 


a„+ib 




a ' ' 


a 



I (an+i,.)^-a^_( 



V 



Let us take a = \? . Since a„+i and a depend on the value of b and since e = eo is fixed, b becomes 
the only "free" parameter. We will prove that properties (1), (2) and (4) hold for large values of b. 
We begin by studying the limiting behavior of different quantities related to the Schur complement 
We will show that 



(3.8) 
(3.9) 
(3.10) 



a 



a 



a 

On+lb 

a 



as 6 — )• oo 
as 6 — )• oo 
—an e as 6 — >• oo. 



Equation (3.8), follows easily by equation (3.5), since 
(3.11) 



an+l^e/ct = —ttn^e/be — )■ aS 6 — )■ OO. 



Now to prove |3.9| recall that, by construction, a„+i = an+i,e i e where the sign depends on the sign 

^'^ (fln+i.e)^ _ ±2ea„-|_i^, --^ 



of Un+i- Therefore 

(On+l)^ («n+l,€)^ (fln 



(3.12) 



a 



a 



+ l,e 

a 



+ 



a 



a 



a 



The first term tends to as 6 — )• oo as sho wn a bove. Also, since a = 6^,a— )-ooas6— t-oo and so 
e^/a — )• as 6 — )• oo. This proves equation (3.9). 
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Finally, for (3.10), if 5 > e then = b — e and 



a 



a 



a 

e ± e— lb — 

a a a 



a 



As we have seen above, a„+i_e/a — )• as & — )• oo. Also, b^/a — )• as 6 — )• oo since a = b^. Therefore, 

Qn+lb an+l,ebe 



(3.13) 







a 



a 



as 6 — )• oo. But by (3.5), we have an+i^ebe/cx = —an,e- Therefore 
(3.14) ^"+^^ ^ ■ 



a 



as 6 — )• OO. 



Using the results in equations (3.8)-(3.10), we now proceed to show that properties (1), (2) and 



(4) hold true for appropriately large values of b. To prove (1), we only need to show that the Schur 



complement given by (3.7) is positive definite for large values of b. Indeed, notice that from (3.9) 



and (3.10), we have 



(3.15) 



/ (a„+i,e)2-a: 

V 



2 

n+l 



a-n+lb 



J 



( 



V On,, 



/ 



elementwise. Therefore the Schur complement of a in An+i given in (3.7) tends to 

(0 ... a„,e ^ 
; ■•. ; 
On,e ••• y 

as 6 — )• oo. This matrix is exactly the matrix A„ with the (l,n) and (n, 1) elements soft-thresholded 
at level e. Therefore, by the induction hypothesis, this matrix is positive definite and so is An+i for 
large values of b. This proves property (1). 

To prove property (2) note that the Schur complement of a in An+i is given by 



(3.17) 



\ 







Notice that the (1, 1) entry of the righthand term is always positive whereas the (n, 1) element tends 
to as 6 — )■ oo. Since the matrix An is positive definite by the induction hypothesis, the Schur 
complement of a in An+i is therefore also positive definite when b is sufficiently large. This proves 
(2). 

Similarly, to prove (4), let us consider the Schur complement of a in the matrix A^+i with the 
(1, n + 1) and (n + 1, 1) entries soft-thresholded at level e 



/ a^+i-(an+i.O^ 



(3.18) 

We have 
(3.19) 



An + 



an+\,tb 



cx 

a / 



0'n+l,tb _ an+l,e{be + c) _ Qn+l.e^e _|_ ^ Qn+l,e 



a 



a 



a 



a 
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From (3.5) and (3.8), we therefore have 



(3.20) > -an,e 

a 

as 6 — >• oo and so the preceding Schur complement is asymptotic to the matrix with the {l,n) 
and (n, 1) elements soft-thresholded at level e. By the induction hypothesis, this matrix is positive 
definite and therefore the same is true for the matrix An+i with the (1, n + 1) and (n + 1, 1) entries 
soft-thresholded at level e when b is large enough. This proves (4). 

Consequently, a matrix An+i satisfying properties (1) to (4) can be obtained by choosing a value 
of b large enough. This completes the induction. Therefore, for every n > 3, there exists a matrix 
An G such that rye(^n) is not positive definite for e = eo = 0.1. 

Now let e > be arbitrary. Notice that for a > and any matrix A, it holds that 

(3.21) T]a,{aA) = arj,{A). 
As a consequence, for a given value of n, consider the matrix 

(3.22) A := —An. 

Then AG Ft since A„ G Pi . Moreover, 

(3.23) ^M) = r?^,„(^) = -r],,{An). 

Since r]eo{An) is not positive definite by construction, it follows that r]e{A) is not positive definite 
either. This provides the desired example of a matrix A G Pj such that r]^{A) is not positive definite. 
Therefore, if every matrix A G Fq retains positive definiteness when soft-thresholded at a given level 
e > 0, the graph G must not contain any cycle and so is a tree. 

(3 =^ 2) The implication in this direction holds for more general functions than the soft-thresholding 
function. The proof is therefore postponed to Section |4] (see Theorem 4.18). 



Finally, since 2 1 trivially, the three statements of the theorem are equivalent. This completes 
the proof of the theorem. □ 

Corollary 3.4 (Complete graph case). For every n > 3, and every e > there exists a matrix 
AgF+ such that ry,(A) P+. 

4. General thresholding and entrywise maps 

The result of the previous section shows that the commonly used soft-thresholding procedure does 
not map the cone of positive definite matrices into itself. A natural question to ask therefore is 
whether other mappings are better adept at preserving positive definiteness. 

In this section, we completely characterize the functions that do so when applied to every off- 
diagonal element of a positive definite matrix. We begin by introducing some notation and reviewing 
previous results from the literature for the case where the function is also applied to the diagonal. 

Definition 4.1. For a function / : M — > M, denote by: 

• f[A] the matrix obtained by applying / to every element of the matrix A, i.e., 

(4.1) (.f[A]),j = f{a,,)- 

• the matrix obtained by applying / to every element of the matrix A, except the 
diagonal, i.e.. 



(4.2) if [A]) 



f{aij) ifi^j 
an if i = j. 
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We now compare f[A] and f*[A] for ^ > 0. Clearly, f*[A] = f[A] + Da, where Da is the diagonal 
matrix 

(4.3) Da = diag(aii - /(an), . . . , a„„ - f{ann))- 

As a consequence, if f[A] > and the elements of Da are nonnegative, then f*[A] > 0. Such is the 
case when |/(x)| < 

Remark 4.2. The condition that |/(2;)| < is a mild restriction which allows us to conclude that 
f[A] > =^ f*[A] > 0. As we shall see below, the converse is generally false for matrices of a 
given dimension. Hence the previous results in the literature characterizing functions which preserve 
positive definiteness, when the function is also applied to diagonal elements, are unnecessarily too 
restrictive. In this sense, previous results in the field are not directly applicable to problems that arise 
in modern-day applications. This reasoning justifies in a major way our treatment of the problem of 
applying elementwise maps only to off-diagonal entries. 

4.1. Background material: Results for /[j4]. It is well-known that functions preserving positive 
definiteness when applied to every element of the matrix must have a certain degree of smoothness 
and non-negative derivatives. As we will see later, this is not true anymore when the diagonal is left 
untouched. 

Theorem 4.3 (Horn & Johnson [5j, Theorem 6.3.7). Let f be a continuous real-valued function on 
(0, oo) and suppose that f[A] is positive semidefinite for every n x n positive semidefinite matrix 
A = (oij) with positive entries. Then f is {n — 3) -times continuously differentiahle and f^''\t) > 
for every t £ (0, oo) and every k = 0, . . . ,n — 3. 

Corollary 4.4. The soft-thresholding operation is not guaranteed to preserve positive semidefinite- 
ness when the diagonal is also thresholded. 

Proof. This follows easily from the non-differentiability of the soft-thresholding function. □ 

Corollary 4.5. Let f be a continuous function and assume f[A] is positive semidefinite for every 
positive semidefinite matrix A with positive entries. Then f G C°°(0, oo) and f^^\t) > for every 
t € (0, oo) and every /c > 0. 



Corollary 4.5 provides a necessary condition for a function / to preserve positive definiteness 
when applied elementwise to a positive definite matrix. We shall show below that this condition is 
also sufficient. We first recall some facts about absolutely monotonic functions and the Hadamard 
product. 

Definition 4.6. Let < a < oo. A function / G C°°{0,a) is said to be absolutely monotonic on 
(0,a) if f^^^x) > for every x G (0, a) and every A: > 0. 

The following theorem characterizes the class of absolutely monotonic functions on (0,a). 

Theorem 4.7 (see ^ Chapter IV]). Let < a < oo. Then the following are equivalent: 

(1) / is absolutely monotonic on (0,a); 

(2) / is the restriction to (0,a) of an analytic function on D{0,a) := {z G C : \z\ < a} with 
positive Taylor coefficients, i.e., 

oo 

(4.4) /(x) = ^a„x" (xG(0,a)) 

n=0 

for some Cn > 0. 

Remark 4.8. Let < a < oo. A function / : {—a, a) — ?• M can be represented as: 



(4.5) f(oc) = J2an 



[—a < X < a) 

n=0 
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for some > if and only if / extends analytically to D{0, a) and is absolutely monotonic on (0, a). 

Recall that the Hadamard product (or Schur product) of two nxn matrices A and B, denoted by 
Ao is the matrix obtained by multiplying the two matrices entrywise, i.e., {A o B)ij = (aijbij). 
Since ^ o i? is a principal submatrix of the Kronecker product A<Si B, the matrix Ao B is positive 
definite if both A and B are positive definite. This last result is commonly known as the Schur 
product theorem. We now state the converse of Corollary |4.5[ The proof follows immediately from 
Theorem |4.7| and the Schur product theorem. 



Lemma 4.9. Let < a < oo and let / : (0,a) — t- M be absolutely monotonic on (0,a). Then f[A] 
is positive semidefinite for every positive semidefinite matrix A with entries in (0,a). 



Combining Corollary 4.5 and Lemma 4.9, we obtain the following characterization of functions 



preserving positive definiteness for every positive definite matrix with positive entries: 

Theorem 4.10. Let f : (0, oo) — t- M. Then f[A] is positive semidefinite for every positive semidefi- 
nite matrix A with positive entries if and only if f is absolutely monotonic on (0,oo). 

The following theorem shows that the result remains the same if the entries of the positive semi- 
definite matrix A are constrained to be in a given interval. Special cases of this result have been 
proved by different authors; we state only the most general version here. 

Theorem 4.11 (see Schoenberg [8j, Rudin [7j, Vasudeva [TDJ, Hiai [4j, Herz [3j, Christensen and 
Ressel |T]). Let < a < oo and let f : (—a, a) — )■ M. Then f[A] is positive semidefinite for 
every positive semidefinite matrix A with entries in {—a, a) if and only if f is analytic on the disc 
{z & C : \z\ < a} and absolutely monotonic on (0,a). 

Recall that one of the primary goals of regularizing positive definite matrices is to "induce sparsity" , 
i.e., set small elements to zero. The following result shows that no thresholding function that induces 
sparsity is guaranteed to preserve positive definiteness. 

Corollary 4.12. Let < a < oo and let f : {—a, a) — )• M satisfy /(O) = and /(7) = for some 
7 G (0, a). Assume f ^ on {—a, a). Then there exists a positive semidefinite matrix A with entries 
in {—a, a) such that f[A] is not positive semidefinite. 

Proof. Assume f[A] is positive semidefinite for every positive semidefinite matrix A with entries in 



{—a, a). Then, by Theorem 4.11 



(4.6) f{z) = Y,akZ>' {zeD{0,a)) 

k=l 

where = /W(0)//c! > 0. Since /(7) = 0, we must have = for every A; > 1, i.e., / = 0. Thus, 
if / ^ 0, there exists a positive semidefinite matrix A such that f[A] is not positive semidefinite. □ 

4.2. Preliminary results for Let ^ be a positive definite matrix. We now proceed to 

analyze mappings / that are applied only to off-diagonal elements of A. The next result provides a 
basic first constraint that / must satisfy in order for f*[A] to retain positive definiteness. 

Lemma 4.13. Let / : M — >• M and assume |/(^)| > \(,\ for some ^ G M. Then, for every graph 
G = {V, E) containing at least one edge, there exists a matrix A G such that f*[A\ is not positive 
semidefinite. 

Proof. Assume first that \ V\ = 2, and without loss of generality assume (1, 2) G E. Since |/(C)| > 
there exists e > such that \ f{£,)\ = |?| + Now consider the matrix 

(4 7) + i ^ 



12 



DOMINIQUE GUILLOT AND BALA RAJARATNAM STANFORD UNIVERSITY 



The matrix B is positive definite, but /* [B] is not positive semidefinite. The general case of a graph 
with n vertices follows by constructing the matrix A = B(Bln-2, where Ik denotes the kx k identity 
matrix. □ 

Recall from Theorem 4.2 that functions preserving positive definiteness when applied to every 
element of a matrix (including the diagonal) of a given dimension have to be sufficiently smooth, 
and have non-negative derivatives on the positive real axis. However, when the diagonal is left 
untouched, the situation changes quite drastically. More precisely, a far larger class of functions 
preserves positivity, as the following result shows. 



Proposition 4.14. Let G = {V,E) be a connected undirected graph and denote by A 
maximum degree of the vertices of G. Assume / : M — ?• M satisfies 



A{G) the 



(4.8) |/(a;)|<c|x| Vx G M, 
for some < c < ^. Then f*[A] G Fq for every A G P^. 

Proof. For every A G Fq, denote by Ma the matrix with entries 

(4.9) {MA)ij ' 



aij 



if aij and i j 
ifi= j 



if aij = and i / j 



The matrix f*[A\ can be written as 
(4.10) 



f*[A]=AoMA. 



Since < c < a,n application of Gershgorin's circle theorem demonstrates that Ma > 0. As a 
consequence, by the Schur product theorem, A o Ma > and so f*[A] > for every A G P^t. □ 

Corollary 4.15 (Complete graph case). Let n>2 and assume / : M — )• M satisfies 

(4.11) |/(a;)| < c|2;| Vx G M, 

for some < c < Then f*[A\ > for every n x n positive definite matrix A. 



Corollary 4.16. For every graph G, there exists a function f : 
A G 



such that f* [A] > for every 



Remark 4.17. Despite the simplicity of the above proofs (especially in constrast to Theorems 3.2 



4.18, and 4.21 of this paper), Proposition 4.14 Corollary 4.15 and Corollary 4.16 have important 
consequences, namely: 

(1) Contrary to the case where the function is also applied to the diagonal elements of the matrix 



see Theorem 4.3), Corollary 4.15 shows that, when the diagonal is left untouched, preserving 



every n x n positive semidefinite matrix does not imply any differentiability condition on /. 
Even continuity is not required. We therefore note the stark differences compared with 
previous results in the area. 



(2) Proposition 4.14 shows that preserving positive definiteness is relatively easier for matrices 
that are already very sparse in term of connectivity, i.e., matrices with bounded vertex degree. 



(3) Corollary 4.15 suggests that preserving positive definiteness for non-sparse matrices becomes 
increasingly difficult as the dimension n gets larger. 

4.3. Characterization of functions preserving positive definiteness for trees. Recall that 
a class of sparse positive definite matrices that is always guaranteed to retain positive definiteness 
upon either hard or soft-thresholding is the class of matrices with zeros according to a tree (see 



Theorems 2.4 and 3.2). A natural question to ask therefore is whether functions other than hard 



and soft-thresholding can also retain positive definiteness. Recall from Lemma 4.13 that for every 
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non-empty graph G, the functions / such that /* [A] € Fq for every A eFq are necessarily contained 
in the family 

(4.12) "rf := {f : R ^ R : |/(a;)| < |x| Vx G M}. 

Note that 'tf is the class of functions contracting at the origin. This "shrinkage" property is often 
required in practice. 

It is natural to ask if we can characterize the set of graphs G for which the functions mapping 
into itself constitute all of The following theorem answers this question. 

Theorem 4.18. Let G = iV^E) he a graph. Then 

(4.13) {/ : M ^ M : /*[^] e for every A G P^} = ^ 
if and only if G is a tree. 

Thus, the result provides a complete characterization of trees in terms of the maximal family 

Proof. (<^) Let G be a tree and assume |/(a:)| < \x\ for all x. We will prove that f*[A\ G P^ for 
every A G P^ by induction on n = \V\. Consider first the case n = 3. Then G is equal to the ^3 
graph with 3 vertices 





and A can be reconstituted as follows: 
(4.14) A 



with a\,a2,oiz > 0. Assume ^ > 0. By computing the determinants of the principal minors, the 
positive definiteness of A is equivalent to 

(4.15) a\{a2(y.^ — b"^) — 0.20^ = 0L\0L20L'i — a\}? — 0:20^ > 0. 
Since \f{x)\ < \x\, it follows that 

(4.16) 0:1(0203 - /(6)^) - a2f{af = 010203 - oiif{bf - 02/(0)^ 

(4.17) > 01O2O3 — oi6^ — 020^ > 0, 

and so f*[A] > 0. The result is therefore true for n = 3. 

Assume the result is true for every tree with n vertices and consider a tree G with n + 1 vertices. 
Let G be a sub-tree obtained by removing a vertex connected to only one other node. Without loss 
of generality, assume this vertex is labeled n + 1 and its neighbor is labeled n. Let ^ G P^. The 
matrix A has the form 

/ 



(4.18) 



A 





\ 







A 











a 


... a 


a J 



By the induction hypothesis, the n x n principal submatrix A of A stays positive definite when 
/ is applied to its off-diagonal elements, i.e., f*[A] > 0. It remains to be shown that the Schur 
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complement of a in is positive definite. Note first that the Schur complement of a in ^ is 

given as: 







(4.19) 



S = A 







Since by assumption A > 0, we have 5" > 0. We also have S G P~. Therefore, by the induction 

G 

hypothesis, f*\S\ > 0. Note that f*[S] is different from the Schur complement of a in More 
specifically, 

/ ... 

(4.20) f[s] = f*[A]- : ••. : 

\ ••• 77 

\ a 

but the Schur complement of a in /* [A] is given by 

/o . 

(4.21) r[A]- ; • 

Vo . 

Since |/(a;)| < |x| by assumption, it follows that the Schur complement of a in f*[A] is equal 
to f*[S] + D where D is a diagonal matrix with non-negative entries. Since f*[S] > 0, the Schur 
complement of a in f*[A] is positive definite, and therefore, so is /*[^]. This completes the induction. 



(=^) Conversely, assume now that G is not a tree and let e > 0. Then, by Theorem 3.2, there 
exists a matrix A G such that {f^)*[A] P^, where denotes the soft-thresholding function 
(see (1.2)). This concludes the proof. □ 



Remark 4.19. A similar result also holds for hard-thresholding with respect to a graph. Indeed, 
note that every subgraph of a graph G is a union of disconnected induced subgraphs if and only if 
G is a tree. As a consequence, matrices in P^ are guaranteed to retain positive definiteness when 



thresholded with respect to any subgraph of G if and only if G is a tree (see Theorem 2.2 and 
[21 Corollary 3.5]). Hence, trees can be characterized by all four types of thresholding operations 
that have been considered: 1) graph thresholding, 2) hard-thresholding, 3) soft-thresholding, and 4) 
general thresholding. 



Remark 4.20. Corollary 4.16 shows that the converse of Theorem 4.18 is false in general, in the 
sense that retaining positive definite matrices for every matrix in P^ does not imply that G is a tree. 

4.4. Proof of the main result. We now proceed to completely characterize the functions / preserv- 
ing positive definiteness for matrices of arbitrary dimension, when the diagonal is not thresholded. 



-7- M. Then f*[A] is positive semidefinite for 
a, a) if and only if f{x) = xg{x) where: 



Theorem 4.21. Let < a < oo and let f : {—a, a 
every positive semidefinite matrix A with entries in 

(1) g is analytic on the disc D(0,a); 

(2) llfflloo < 1; 

(3) g is absolutely monotonic on (0,a). 

When a = oo, the only functions satisfying the above conditions are the affine functions f{x 
forO<a<l. 



ax 



Proof. (=^) Let n > 2 and let A, B be two n x n positive semidefinite matrices with entries in (—a, a) 
such that A>B. We first claim that f*[A] > 2f[B] - f*[B]. Consider the block matrix 



(4.22) 



M 



A 
B 



B 
B 



A-B 




+ 



B 
B 



B 
B 



> 0. 
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Since M is positive semidefinite, /*[M] is also positive semidefinite by hypothesis. Thus 

riA] f[B] 



(4.23) 
In particular, 



f*[M] 



(4.24) 



/ -/ 




f*[A] f[B] 
f[B] riB] 



f[B] r[B] • 

/ ^ f f*[A]-f[B]-f[B] + f*[B] 



-/ 











> 0. 



Thus, we must have f*[A] - 2f[B] + f*[B] > 0, i.e., f*[A] > 2f[B] - f*[B]. This proves the claim. 

Now, denote by Im the mx m matrix with every entry equal to 1. Since A> B, we also have, for 
every m > 3, that Im'i^ A> Im'i^ B, i.e., 



(4.25) 



f A A 
A A 



A\ [ B B 
A B B 

a) 



> 



B \ 
B 



\ B B .., 



B J 

B]. 



\ A A .. 

Applying the claim to the above matrices, it follows that 

(4.26) r[lm(^A]>2f[lrn(E)B]-r[U 
Let Db denote the diagonal matrix 

(4.27) Db = diag {bn - f{bn), . . . , &nn - /(6„„)) . 
Then f*[B] = f[B] + Db and so, denoting by Im the m x m identity matrix, 

(4.28) 2/[l„ C^B]- r [U ^B] = lm<^ f[B] - ® Db. 
Similarly, if 

(4.29) Da = diag (an - /(an), . . . , a„n - f{ann)) , 
then f*[A] = f[A] + Da and 

(4.30) /* [Im 0A] = lm'S)f[A] + Im0DA. 



Im ® f[A] + Im(S) DA>lm(S) f[B] -Im® Db, 



Combining equations (4.26), (4.28) and (4.30), we obtain 
(4.31) 
or equivalently, 

(4.32) Im ® {f[A] - f[B]) > -Im ® {Da + Db). 



In matrix notation, equation (4.32) is equivalent to 

m]-f[B] \ 

f[A]-f[B] 



(4.33) 



/ f[A]-f[B] f[A]-f[B] 
f[A]-flB] f[A]-flB] 



> 



\ f[A]-f[B] f[A]~f[B] ... f[Ayf[B] J 



( Da+Db 

Da+Db 

\ ^ 



Da+Db J 



Note that for every nxn symmetric matrix M with eigenvalues Ai, . . . , A^, the eigenvalues ollm®M 

Q 

/(^n)). 



are {0, mAi, . . . , mA„}. Now apply Weyl's inequality to (4.32) to obtain 

(4.34) m\raUf[A] - f[B]) > - max(aii + bu - /{an 

Dividing by m and letting m — )• oo shows that 

(4.35) f[A]-m>Q. 

We have thus shown that f[A\ > f[B] for every A > B > {) with entries in (—a, a). Since, by 
hypothesis, f*[A\ > for every A > 0, we know from Lemma 4.13 that / has to satisfy |/(x)| < 
for every x. In particular, /(O) = 0. Applying (4.35) with B = now shows that f[A] > for every 
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A>0. Hence, by Theorem 4.11, / is analytic on D{0,a) and is absolutely monotonic on (0,a) 
/('=)(0) > for every A; > 0. In other words, 



I.e., 



(4.36) 



m = E 



akZ 



k=l 



where := f^^\0)/kl > 0. Finally, since / satisfies < 
defined by ^(O) = and 



X 



(see Lemma 



4.13), the function g 



(4.37) 



9[x) 



^atx^ 1 (x/O), 
fc=l 

satisfies \g{x)\ < 1 for every x, i.e., ||5'||oo ^ !• Therefore, f{x) — xg{x) for a function g that is 
analytic on Z?(0,a), absolutely monotonic on (0,a), and satisfies the condition ||(7||oo 

< 1. 

(<;=) Conversely, assume f{x) = xg{x) for some function g analytic function on D{0, a), absolutely 
monotonic on (0,a), and satisfying \\g\\oo < 1- Then from Theorem 4.11 f[A] > for every A>0 



with entries in {—a, a). Note that f*[A] = f[A] + D where D is the diagonal matrix 
(4.38) D = diag(aii - /(an), . . . , a„n - /(arm))- 

Since ||g||oo < 1; then |/(x)| < and thus the elements of D are non-negative. Hence, f*[A] > 
for every ^ > with entries in (—a, a). 

In the case when a = oo, the only bounded absolutely monotonic functions g on (0,oo) are the 
constant functions g{x) = a for some a > 0. Since \f{x 
completes the proof of the theorem. 



< Ixl we must have < a < 1. 



This 
□ 



Remark 4.22. We now compare and constrast Theorem 4.21 to the analoguous results proved in 



the literature, namely Theorem 4.11 



(1) Recall that, for matrices ^ > of a given dimension, if |/(2;)| < \x\ and f[A] > 0, then 



f*[A] > 0. The converse is generally false (see Remarks 4.2 and |4.17). 



(2) In constrast, for ^ > of arbitrary dimension, the above proof shows that f[A] > and 
|/(x)| < |x| if and only if f*[A] > 0. 



Theorem |4.21 shows that only a very narrow class of functions are guaranteed to preserve positive 
definiteness for an arbitrary positive definite matrix of any dimension. In practical applications, 
thresholding is often performed on normalized matrices (such as correlation matrices) which have 
bounded entries. In that case, more functions preserve positive definiteness. However, as in the 
case where the function is applied to the diagonal, the following result shows that no thresholding 
function can induce sparsity (i.e., set non-zero elements to zero) and, at the same time, be guaranteed 
to maintain positive definiteness for matrices of every dimension. 

Corollary 4.23. Let < a < oo and let f : {—a, a) — ?• M satisfy /(O) = and f{'y) = for some 
7 E (0, a). Assume f ^ on {—a, a). Then there exists a positive semidefinite matrix A with entries 
in {—a, a) such that f*[A] is not positive semidefinite. 



Proof. The proof is the same as the proof of Corollary 4.12 



□ 



5. Eigenvalue inequalities 

The results of Section [4] show that only a restricted class of functions are guaranteed to preserve 
positive definiteness when applied elementwise to matrices of arbitrary dimension. Moreover, no 
function can at the same time induce sparsity (have zeros other than at the origin) and simultaneously 
preserve positive definiteness for every matrix. Hence, a natural question to ask is whether certain 
properties of matrices (such as a lower bound on the minimum eigenvalue or an upper bound on the 
condition number) are sufficient to maintain positive definiteness when a given function / is applied 
to the off-diagonal elements of the matrix. We provide such sufficient conditions in this section. The 
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results are first derived in Section 5.1 for the case when / is a polynomial. They are then extended 



to more general functions in the subsequent subsection. 



ajX 



5.1. Bounds for polynomials. We first establish some notation. For a polynomial p(x) = ^^^q 
define its "positive" and "negative" parts by: 

(5.1) p+{x)=^^aix\ p-{x) = - aix\ 

ai>0 ai<0 

Many of the results in this section are motivated by the following idea. Note that 

(5.2) p*[A]=p+[A]-p^[A]+Da 

where D is the diagonal matrix Da = diag(aii — p(aii), . . . , ann — piann))- Repeated applications of 
the Schur product theorem can be used to show that both p+ [A\ and p- [A\ are positive definite when 
A is positive definite. Intuitively, a polynomial with a positive part that is "larger" than its negative 
part should be able to preserve positive definiteness for a wider class of matrices as compared to a 
polynomial with a "large" negative part. This idea is formalized in Proposition |5.3| below. Before 
stating the result, recall the following classical result that can be used to bound the eigenvalues of 
Schur products. 

Theorem 5.1 (Schur [9j). Let A, B he positive definite matrices. Then for i = 1, . . . , n, 

(5.3) Xrmn{A) mm On max On . 

i i 

Corollary 5.2. Let A he an n y. n matrix. Denote hy dmin cind dmax the minimal and maximal 
diagonal element of A respectively. Then for i = 1, . . . ,n, 

(5.4) rf™nA™n(^) < A,(^°'=) < dt^^XmaM), 

where A°^ denotes the Hadamard product of k copies of A. 

We now proceed to state the main result of this subsection. 

Proposition 5.3. Let p be a polynomial and assume p{0) = 0. Let A be annxn matrix, and denote 
by dmin and dmax the smallest and largest diagonal elements of A respectively. Then: 

(1) XminiP*[A]) > p+{Xmin{A)) - p_{Xm,ax{A)) + mm {on - p{aii)) ; 

i=l,...,n 

(2) Xmin{P*[A]) > Amm(^) ^^j^'"'"'* - Amax(^) ^"^^'""'^^ + . min {ou - p{aii)) . 

dmin 0"max i—l,...,n 

Proof. By Weyl's inequality, 

(5.5) A„,in(p[^]) > a^^min{An + ^ a,An,ax(^°') ■ 

ai>0 ai<0 

The first result follows by using Cauchy's interlacing theorem to observe that Amin(^°*) > Amm(^)\ 
Amax(^°*) < Amax(^)*, in addition to the fact that p*[A] = p[A] + Da, where Da is the diagonal 
matrix Da = diag(aii— 7j(oii), . . . , ann—p{ann))- The second assertion follows by the same argument, 
but then uses Corollary |5.2| to bound the eigenvalues of the Schur product. □ 



Corollary 5.4. Let n>l and let p he a polynomial such that p(0) = 0. Let A he an n x n matrix 
with spectral radius denoted hy p{A) . Then p* [A] > if 

(5.6) mm {an - p{aii)) > p-{p{ A)). 

1=1,... ,n 

The following surprising result shows that some polynomials having negative coefficients can pre- 
serve large classes of positive definite matrices (like the class of correlation matrices). 

Corollary 5.5. Let n > 1 and let p he a polynomial such that p{0) = and P-{x) < 1 — p{l) for 
every < x < n. Then p*[A] > for every n x n correlation matrix A. 
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Proof. This follows from Corollary 5.4 by noticing that Aniax(^) < n for every n x n correlation 



matrix A. □ 



Corollary 5.6 below shows that p*[A] is guaranteed to be positive definite if the condition number 
of A is sufficiently small. Note that the bound becomes more restrictive as the "negative part" of p 
becomes larger compared to its "positive part" . 

Corollary 5.6. Let p be a polynomial and assume \p{x)\ < \x\ for every x G [—a, a], for some a > 0. 
Then p*[A] > for every positive definite matrix A with entries in [—a, a] such that 

/r fT\ J / /I ^ ^maxi,-^) ^ P+\dmin) dmax 

(5.7) cond(^) := — - < 



Xmin{A) p-{d max) dmin 

Corollary 5.7. Let p be a polynomial and assume \p{x)\ < \x\ for every x G [—1, 1]. Then p*[A] > 
for every correlation matrix A such that 

(5.8) cond(A) := ^ < ?±ffl. 



5.2. Extension to more general functions. We now proceed to extend the results of Section 5.1 
to more general thresholding functions. We first recall the following well-known result. 

Lemma 5.8. Let be the set of polynomials with positive coefficients and let r > 0. Then 
the uniform closure of over [— r, r] is the restriction to [—r, r] of the set of analytic functions 
f{^) — Yln>o^nz"' on the disc D{0,r) = {z £ C : \z\ < r} with On > for every n > and 
E„>oanr" < oo. 

Definition 5.9. For r > we define 

(5.9) W+{r) = i f{z) = a„z" G Hol(D(0, r)) : ^ |a„|r" < oo 

[ n>0 n>0 

The space := W~^{1) is often known as the analytic Wiener algebra of analytic functions. The 
space W~^{r) can be seen as a weighted version of the analytic Wiener algebra. 

As mentioned above, every function in W~^{r) has a continuous extension to the closed disc D{0, r). 
Notice that W~^{r) C H'^{D(0,r)), the space of bounded analytic functions on the disc D{0,r). 



We first begin by extending Proposition 5.3 



Proposition 5.10. Let f G W~^{a) for some a > and assume /(O) = 0. Write f = f^ — f- where 
f+,f- £ W^{a) have nonnegative Taylor coefficients. Then for every n x n matrix A = (aij) with 
Oij G [—a, a], we have: 

(1) ^m^n{f*[A]) > /+(A™„(A)) - f-{Xmax{A)) + min (a,, - fiau)); 

i=l,...,n 

(2) Xm^n{f*[A]) > \^,^{A)^-±f^ - Xmax{A) ^~f""'"^ + .min {ou - f{au)). 

d-min Jmax i—l,...,n 



(k) (k) 

Proof. By Lemma 5.8, there exist sequences of polynomials with positive coefficients p\ , p_ such 



that 



(5.10) ^ /+ aiid P f- 

uniformly on [—a, a] as /c — )• oo. Let p'^'^^ := — p^^K An application of the triangle inequality 
shows that p^^^ — t- / uniformly on [—a, a]. Now, by Proposition 



5.3 



(5.11) An.in(p('=)[A]) > (A^in(.4)) - pL^^ (A^ax(^) ) + min {au - p^^'Hau)) ■ 

i=l,...,n 

The first result follows by the continuity of the eigenvalues and uniform convergence. The second 
part follows similarly. □ 
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Corollary 5.6 can also be easily extended using a similar argument. 

Theorem 5.11. Let f G W~^{a) for some a > and assume \ f{x)\ < \x\ for every x € [—a, a] . Then 
f*[A] > for every n x n matrix A = (aij) with Oij G [—a, a] such that 

/cioN J / A\ ^ f+{dmin) dmax 

(5.12) cond(yl) < — — r- , 

J—yflraax) 0"min 

where dmin cLnd dmax denote the minimal and maximal diagonal element of A respectively. 

The previous results can easily be extended to more general functions that can be approximated 



pointwise by polynomials. The following result illustrates this idea for Corollary 5.6 



Theorem 5.12. Let f : [—a, a] — )• M for some a > and assume < |x| for every x G [—a, a]. 

Moreover, assume f is the pointwise limit of a sequence of polynomials p^"^ on [—a, a] . Then f* [A] > 
for every A > with entries in [—a, a] such that 

/r- io\ J 4 ^ T p\-\dmin) dmax 

(5.13) cond A < lim sup ^!^T ; , 

where dmin cind dmax denote the minimal and maximal diagonal element of A respectively. 

Proof. Since ^("^[A] converges to f[A\ entrywise as n — )• oo, the eigenvalues of p'^")[j4] converge to 



the eigenvalues of f[A\. The result follows from Corollary 5.6 using a limiting argument similar to 



the one in the proof of Proposition 5.10 □ 
Acknowledgment. We wish to thank Apoorva Khare for useful comments and discussions. 
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